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PRODUCT-LIMIT ESTIMATORS OF THE SURVIVAL FUNCTION 
WITH TWICE CENSORED DATA 

By Valentin Patilea and Jean-Marie Rolin 1 

CREST-ENSAI and Universite Catholique de Louvain 

A model for competing (resp. complementary) risks survival data 
where the failure time can be left (resp. right) censored is proposed. 
Product-limit estimators for the survival functions of the individual 
risks are derived. We deduce the strong convergence of our estimators 
on the whole real half-line without any additional assumptions and 
their asymptotic normality under conditions concerning only the ob- 
served distribution. When the observations are generated according 
to the double censoring model introduced by Turnbull, the product- 
limit estimators represent upper and lower bounds for Turnbull's es- 
timator. 

1. Introduction. Consider the problem of nonparametric inference with 
competing risks survival data. The novelty we propose is that the failure time 
can be left-censored, for instance, at the time the study starts. For simplicity, 
we consider two distinct competing risks of failure, the extension to more 
than two competing risks being straightforward. Let T and V\ denote the 
latent independent lifetimes for each cause of failure. The failure time is 
min(T, Vi) and it can be censored from the left by a censoring time U\. The 
observations are independent copies of a lifetime Y, a finite nonnegative 
random variable and a discrete random variable A with values in {0, 1,2}, 
where 2 indicates a left-censored failure time, while and 1 correspond 
to an observation equal to T and Vi, respectively. If T is the lifetime of 
interest, we say that Y is a twice censored observation of T. Associated 
with the problem of competing risks is the dual problem of complementary 
risks where the observed failure time is the maximum of the lifetimes for 
each cause of failure (e.g., [1]). The extension we consider here is that the 
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failure time can be right-censored, for instance, at the time the experience 
ends. 

By the plug-in (or substitution) principle applied for the empirical distri- 
bution, the nonparametric estimation of the distribution of a latent lifetime 
of interest is straightforward as soon as this distribution can be expressed 
as an explicit function of the distribution of the observed variables. The two 
models we propose in this paper allow for explicit inversion formulae, that is, 
the latent distributions of interest are explicit functionals of the distribution 
of the observations. 

In Section 2 we introduce our latent models, while in Section 3 we provide 
the inversion formulae. In Section 4 we compare our model with the doubly 
censored data latent model proposed by Turnbull [14]. We show that the 
inversion formulae provide lower and upper bounds for the distribution of 
interest identified by Turnbull's model. Applying the inversion formulae to 
the empirical distribution, we deduce in Section 5 the product-limit estima- 
tors. In Sections 6 and 7 we deduce the almost sure uniform convergence 
and the asymptotic normality for our functionals. 

2. Latent variables models. The random variables we consider take val- 
ues in M. = [0, oo] endowed with B + the Borel cr-field. If X is such a variable, 
Fx denotes its distribution. 

For the first latent model considered (call it Model I), let T and V\ 
be two lifetimes and let f/i be a left-censoring time. Assume that T, V\ 
and Ui are independent. Suppose that Y and A are observed, where Y = 
max[min(r, Vi), U\] and 

TO, HUxKTKVx, 
A=\l, HUi<V l <T, 

{2, if min(T,Vi) < U x . 

Define the observed subdistributions of Y as 

H k (B) = P[Y EB,A = k], k = 0,1,2, 

where B is a Borel set in [0, oo] ; the distribution of Y is H = Hq + H± + H2. 
In Model /, the subdistributions of Y can be expressed in terms of the 
distributions of the latent variables as follows: 

H (dt)=F Ul ([0,t))F Vl ([t, 00]) F T (dt), 
(1) H 1 (dt)=F Ul ([0,t))F T ((t,oo])F Vl (dt), 

H 2 {dt) = {1 - F r ((t,oo])Fy 1 ((t,oo])}F C7l (dt) 

[necessarily # ({0}) = #i({0}) = 0]. If S 1 = min(T, V x ) and H 01 = H + H 1 , 
the three equations imply 



(2) H 01 (dt)=F Ul ([0,t))F Sl (dt), H 2 (dt) = F Sl ([0,t])F Ul (dt). 
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This indicates that the problem of inverting the model, that is, expressing 
the distributions of the latent variables in terms of the subdistributions of 
Y, can be solved in two steps. First, determine the distributions of U\ and 
S\ as in an independent left-censoring model. Next, use these distributions 
and the first equation in (1) to recover the distribution of T. 

As an application of Model 7, consider a reliability system which consists 
of three components Ui,T and Vi, with T and V\ in series and U\ in parallel 
with this series system (see, e.g., [8], Chapter 15). The lifetimes of Ui, T 
and Vi are independent and when the system fails we are able to determine 
which component failed at the same time as the system. Morales, Pardo and 
Quesada [9] propose the application of this model to study a certain cause 
of death for trees on a farm. 

For our second latent model (call it Model 77), let U2 and T be two 
lifetimes and let V2 be a right-censoring time. Suppose T, U2 and V2 are inde- 
pendent. The observed variables are Y and A, where Y = min[max(T, U2), V2] 
and 



In Model 77, the relationship between the subdistributions of Y and the 
distributions of the latent variables is described by the equations 



[necessarily Hq({0}) = 0]. If S2 = max(C/2, T) and 77q2 = Hq + H2, we obtain 



(4) H 2{dt) = F V2 {[t,oo])F S2 (dt), Hi(dt) = Fs 2 {{t,^])F V2 (dt). 



These relations show that Model 77 can be inverted in two steps. First, as 
in an independent right-censoring model, recover the distributions of V2 and 
S2 from 77o2 an d 77i. Second, use the distributions of V2 and S2 and the 
first equation in (3) to determine the distribution of T. 

Model 77 can be interpreted as follows: consider a system consisting of 
three components U2, T and V2 with independent lifetimes. Put T and U2 
in parallel and V2 in series with this parallel system (see also [2], page 767). 
Again, assume that we are able to determine which component failed at the 
same time as the system. 

3. Inversion formulae. Recall that if F is a probability distribution on 
(M + ,IT , ~), the associated hazard measure is L([0,t]) = — In F({t, 00]). Two 
more hazard measures can be defined, 
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0. 
1. 
2. 



if U 2 < T < V 2 , 

if V 2 <max([/ 2 ,T), 

if T < U 2 < V 2 . 



(3) 



77 (dt) = F U2 ( [0, t) )F V2 ( [t , 00] )F T (dt ) , 
Hi(dt) = {1 - F T ([0,t])F U2 ([0,t])}F V2 (dt) 
77 2 {dt) = F T ( [0, t])F Va ( [t, oo^Fu, (dt) 



L~[dt) 



F(dt) 



and L + (dt) 



F(dt) 



F([t,oo]) 



F((t,oo]) 
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which we call the predictable and the unpredictable hazard measure, re- 
spectively. The three hazard measures have the same continuous parts. 
Moreover, their point masses are in bijection: L({t}) = — ln[l — L~ ({£})] = 
ln[l + L + ({£})]. The probability distribution F can be expressed as 



F((t,oo]) =exp{-L([0,t])} = 7T(1 - L-{da)) 



71(1 + L + (ds)) 



-\ -i 



[o,t] L P.*] 

where 71 is the product-integral (e.g., [6]). The mass of L at infinity is 
irrelevant for F and F({oo}) = exp{— L([0, oo))}. 

Similarly, by reversing time, the reverse hazard measure associated to 
F is Af((t, oo]) = — ln.F([0, t]). Moreover, the predictable and unpredictable 
reverse hazard measures are defined as 



F([0,t\) F([0,t))' 

respectively. The three reverse hazard measures have the same continuous 
parts and their point masses satisfy M({t}) = — ln[l — M - ({£})] = ln[l + 
Af+({t})]. We have 



n -l 

F([0,t])=exp{-M((t,oo])}=7t(l-M-( < fe)) 

(t,oo] 



7t(l + M + (ds)) 

(t,oo] 



The mass M({0}) is irrelevant for F. Moreover, F({0}) = exp{-M((0, oo])}. 

Given a nonnegative measure on (M + ,S + ), we can always define a prob- 
ability distribution on the same space by considering this measure as be- 
ing one of L, L~ or L + (resp. M, M~ or M + ) and using the relations 
above. For instance, in the independent right-censoring model, one defines 
L~(dt) = Ho(dt)/H([t, oo]), with Hq the subdistribution of the uncensored 
data. Then, by the equations of the model, the distribution corresponding 
to this L~ is nothing else than the distribution of the lifetime of interest. 
The reverse hazard measures M, M~ and M + are the counterparts of L, 
L~ and L + to be used in left-censoring models. 

We can invert our models using the hazard measures above. Since, apart 
from mild conditions at the origin, the inversion formulae below apply to 
any subdistributions (Hq,H\,H2), we deduce them without any reference 
to the latent variables. 

For inverting Model /, assume Hq({0}) = Hi({0}) = 0. In view of (2), 
proceed as for inverting a left-censoring model and define the predictable 
reverse hazard measures 

(5) M 2 -(dt) = 4^-, M m (dt)- Hm{dt) 



H([0,t])' U1V ; H([0,t))+H 01 ({t}) 
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and let Frf and F^ be the corresponding distributions. By this defini- 
tion, we have H([0,t]) = F 2 7 ([0,£])F / 1 ([0,i]). In the second step of the in- 
version, note that the first equation in (1) and the definition of Si im- 
ply iJo(^)/% 1 ([0,t)) J P 5l ([t,oo]) = F T (di)/F T ([f,oo]). This suggests defin- 
ing the predictable hazard measure 

T i-(j + \_ H (dt) 

Let Fj, be its associated distribution. 

For Model II, assume Hq({0}) =0. Look at the relation (4) and, exactly 
as in a right-censoring model, define the predictable hazard measures 



ff([«,oo])' " ' H((t,co]) + H 1 ([t}) 

Let Fq2 and Fj 1 denote the corresponding distributions. Clearly, H((t, oo]) = 
Fj 1 ((t, oo])Fq2 ((t, oo]). In the second step of the inversion, by the first equa- 
tion in (3) and the definition of S 2 , H (dt)/{F V2 ([t,oo])F S2 ([0,t)) + H ({t})} = 
FT(dt)/Fr([0,t]). Consequently, define the predictable reverse hazard mea- 
sure 

(7) M$-{dt)- H ° {dt) 



Ff([t,oo])F ^([0,t)) + iIo({n) 

and let Fjf be its associated distribution. 

Now, consider the identification problem. If Model I is correct, we look 
for conditions ensuring that Fj, = Ft on ]R + . Define the support of fi, a 
nonnegative measure on [0, oo], as supp(n) = {t : /x([0, t])fj,([t, oo]) > 0}. Let 
B 1 ={t: Fj^QO, t))F Vl ([t, oo]) > 0}. Deduce from (6) that the support of hlf 
is equal to the support of H . As supp(H ) =B\ f] supp(Frp), 

F T = F T onR + supp(F T ) C B x . 

By similar arguments, if Model II is correct, Fjt = Ft on M + if and only if 
supp(Ft) C {t : F Ua ([0, t))F V2 ([t, oo]) > 0}. 

4. Comparisons with the doubly censored data model. The models we 
propose are closely related to the model for doubly (left and right) censored 
observations introduced by Turnbull [14]. In Turnbull's model the lifetime T 
is independent of the censoring variables (L, R) and L < R. The observations 
are independent copies of Y and A, where 

Y = max[min(T, R), L] = min[max(T, L),R], 

0, if L < T < R (no censoring), 
A = { 1, if {L<)R<T (right censoring), 
if T < L(< R) (left censoring). 
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If Hk{dt) = P(Y G dt,A = k), k = 0, 1,2, the equations of the model are 

H (dt)={F L ([0,t))-F R ([0,t))}F T (dt), 

(8) H 1 (dt)=F T ((t,oo])F R (dt), 
H 2 (dt) = F T ([0,t})F L (dt). 

Note that the assumptions of the model imply 

(9) H([0,t]) = F L ([0,t])F T ([0,t]) + F R ([0,t])F T ((t,^]). 

In Turnbull's model T is censored from the left by L and from the right 
by R and the observation Y is always the variable in the middle. This is 
different from the censoring mechanisms we consider: in Model I the variable 
min(T, V\) is left-censored, while in Model // the variable max(U2,T) is 
right-censored. 

Turnbull [14] proposed a nonparametric maximum likelihood estimator 
that can be obtained as the implicit solution of the equations (8). The im- 
plicit definition of Turnbull's estimator makes its asymptotic properties quite 
difficult (see [7]). Moreover, a numerical algorithm is needed for the appli- 
cations. 

We are interested in the relationship between our F T , F T T and F T iden- 
tified by Turnbull's model. In fact, for any subdistributions Hq, H\ and H2 
with tf ({0})=#i({0})=0, 

F T ( [0, t] ) < F T ( [0, t] ) < Fi 1 ( [0, t] ) V t > 0, 

where Ft is the distribution of T identified by Turnbull's model. Indeed, in 
Model I use definition (6) and H([0, t]) = F 2 7 ([0, t])F^ ([0, t]) to write 

1 ^ Fi([0,t))-H([0,t)) 
In Turnbull's model [relations (8) and (9)] we have 

H (dt) 



L^(dt) 



F L ([0,t))-H([0,t)Y 
Next, the definition of M 2 ~ , the last equation in (8) and equation (9) imply 

21 ' F t ([0,(])F T ([0,t]) + F R ([0,t])Fr(((,oo]) lV ' 

Deduce that the measure is smaller than the measure M£ . Therefore, 
Fj([0,t)) > F L ([0,t)), Vt > 0. Hence, the measure L T is smaller than the 
measure L T , which implies F T ([0,t]) < F T ([0, t]), Vt > 0. 

On the other hand, for Model //, use the general relationship between 
M+ and M~ , the definition (7) and H((t, 00]) = F/ 7 ((t, oo])F^((t, 00]) and 
write 

M T I+ (dt) - Ho{dt) 



F{'([t,co])-H([t,co\) 
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Meanwhile, in TurnbulFs model, 

H (dt) 



M+(dt) 



T ^> F R ([t,oo]) - H([t,oo])' 

Next, use the definition of the general relationship between L + and L~ , 
the second equation in (3) and the equality H((t, oo]) =FL((t, oo\) Ft ([0, t\) + 
Fji((t, oo])-Fr((t, oo]) [this is a consequence of (9)] to deduce 

11 ; F L ((t,oo})F T ([0,t})+F R ((t,oo})F T ((t,^}) * 

Clearly, the measure Lf is smaller than the measure and, therefore, 
Ff 1 ([t,oo]) > oo]), Wt > 0. Hence, the measure M T I+ is smaller than 

the measure M£ and this implies F^ J ([0,t]) > F T ([0, £]), Mt > 0. 

5. Product-limit estimators. If we replace in the expressions of F T and Fjf 
the subdistributions Hq, Hi and H2 by their empirical counterparts, we ob- 
tain the product-limit estimators FLj, and F^ T , respectively. For this, denote 
by {Zj : 1 < j < M} the distinct values in increasing order of 1^ in a set of 
independent identically distributed (i.i.d.) observations {(Yi,Ai) : 1 <i< n}. 
Define 

D kj = 1 {Y t =Z J ,Ai=k}> N j = zZ HYiKZj}, N j = 1 {Y l >Z J }, 

l<i<n l<i<n l<i<n 

k = 0, 1, 2. With these definitions, the product-limit estimator of Ft in Model 
I is 

where 



n {1-^}. 



j<k<M 

The product-limit estimator of Ft in Model II is given by 



F* T ([o,z j })= n I 1 



j<k<M 

where 



Vfc - N k + D, 



OA' 



V j = n J] {l 



When the doubly censored data model is considered, our product-limit es- 
timators represent lower and upper bounds for Turnbull's estimator. These 
bounds may serve for the numerical algorithms used to compute Turnbull's 
estimator. 
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6. Strong convergence. We study the strong (almost sure or a.s.) uni- 
form convergence of F^ T and F^ T . Since, in fact, the estimators F^ T and 
F^j, are built as explicit functionals of the empirical distribution, we deduce 
their asymptotic behavior, in particular, the strong convergence, whatever 
the properties of the underlying censoring mechanism are. Hereafter, we use 
the following rule: the subscript n indicates the empirical version of the 
quantities we consider. Moreover, if fj, is a nonnegative measure on (R , £> ) 
and / is a measurable function, fi(f) = J f(t)/j,(dt). 

For the strong convergence, we recall a result of Rolin [12], an extension 
of the strong law under right-censorship proved by Stute and Wang [13]. 
Let H = J2i<r<g Hr be a probability distribution decomposed into g subdis- 
tributions. If J C K = {r : 1 < r < g}, let H x = J2rei H r- For J k C l k C JC, 
k = 1, 2, define 

L - (dt) H jMt) 
Lk[dt) ~ H({t,oo])+H Xk ({t}) 

and consider the measure G{dt) = exp{— L2([0,t))}L^ (dt). 

Theorem 6.1. If G(f) < oo, then G n (f) — > G(f) a.s. and in the mean. 
The same result holds if we define the predictable reverse hazard measures 

M-(dt)- H ^( dt ) fc-12 

M * {dt) -H([0,t)) + H Ik ({t}y ' ' 

and consider the measure G{dt) = exp{— M2((t, oo])}M-f(di). 

Let us extend the number of hazard measures associated with Model / 
by defining 

/ r \ H (dt) / i \ HUdt) 

M o (dt)= — r- ° ' „ , Mf (dt)~ 



H([0,t)) + H 01 ({t}Y ^ ' ff([0,t))+ffi({t})- 

Consider Fq,F{, the corresponding distributions. Deduce that Mq\ = Mq + 
Mi , where Mq , M\ and Mq\ are the reverse hazard measures associated with 
Mj~,M{~ and M^ [see (5)], respectively. In view of equations (2), deduce 
H([0,t)) =F 2 I ([0,t))F I 1 ([0,t)) and H 01 ({t}) = i#([0, <))!&({*}). Therefore, 

H (dt) = Ft ( [0, t) )F I 1 ( [0, *])M - (dt) . 

Consequently, in the expression of the predictable hazard measure defining 
Fj, [see (6)], we get rid of F 2 J and obtain 

L^(dt) = ^Ml-M -(dt). 
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Theorem 6.2. If f is a nonnegative Borel measurable function de- 
fined on (K + ,B + ) such that L^T(f) < oo, then, almost surely as n — > oo, 

4K/)-4~(/)- 

Theorem 6.2 is a direct consequence of the following lemma. 
Lemma 6.3. (i) If L^T(f l, o t -i) < oo and Fj 1 ([t,oo]) > 0, then a.s. 

4r(/%t])^4~(/ 1 [0,t])> n^CX). 

(ii) 7f Ly - (/lr t)00 i) < oo and ^([0, i)) > 0, then almost surely as n^oo, 
4r (/%,<»]) -> 4" (/%<»])■ 

Proof, (i) First, Theorem 6.1 implies that any empirical distribution 
function defined by the empirical reverse hazard measures of Model I con- 
verges uniformly on [0, oo] . Now, 

F j{ r (s) TA ([0,s])M- (ds; 
(o,t] ^oi([ s >°°]) 



L nT(/ 1 [0,t]) 

ll-^nOl ~ ^01 



< 



^Ol(k°°D Ao,t] °o]) 



F n V([0, S ])M- (d S ). 



The second member of the inequality tends to zero almost surely because 
F^ 01 ([t, oo]) — > Fg 1 ([t, cx)]) > a.s. and, by Theorem 6.1 applied for G(ds) = 
exp{-Af i((s,oo])}M -(ds), 



/(*) 



-F n / 01 ([0, S ])M- (d S )^4-(/l 



[o,*]> 



a.s. 



'(o,t] *oi([ s >°°])' 

(ii) First, looking at the definition of M^, by a simple computation, 

% s,oo <F 01 s,oo < -— -. 

floi(l0,ooj) 

Using definition (6) for the predictable hazard measure defining Fj,, we have 

f(s) H n0 (ds) 



F n2 ~ F * 



< 



< 



[t ,oo]Fi([0,s))Fi 01 ([s,oo]) 
f(s) H n0 (ds) 



F^([0,t)) J[t,ao] Fi([0,s)) F n V([ S ,oo]) 



F n2 ~ 



f(s) H n0 (ds) 



F' 2 ([0,t)) J [t ,oo] Fj([0,s)) H n01 ([s, oo])' 
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Now, almost surely F^ 2 ([0,t)) — > ([0,t)), which is strictly positive. Since 

/ Jrmv tTTT^ ^ ^oi([0,oo])- 1 4-(A [t , oo] ) < oo, 
J [t,oo] F< ([0, s)) H 01 ([s, oo\) l ' J 

a new application of Theorem 6.1 provides the result. □ 

Denote by iofc the left endpoint and by t\k the right endpoint of the sup- 
port of k = 0, 1, 2. We have the following corollary of Theorem 6.2. Note 
that the strong uniform convergence of F^ T is obtained without any addi- 
tional assumption, apart from that of i.i.d. observations and the condition 
F ({0})=#i({0})=0. 

Corollary 6.4. (a) If Lif([0, tio)) < oo, then, almost surely, 

sup |4 T ([o,t])-4([o,t])Ho 

0<t<t 10 

and L^ T ({iio}) — > 4({iio})- If Lt~([0, tio)) = °°> then, almost surely, 
sup |4 T ([0,a])-4([0,fl])|->0 

0<s<i 

/or a// 1 < t\o and L^ r ([0,iio)) — ► oo. 

(b) AZmost surely, \\F^ T - Fj.\\ = swp^^ \F^ T ([0,t}) - F^([0,t])\ -> 0. 

Proof. The Glivenko-Cantelli theorem provides the result in (a) with 
Lj- and L^ T replaced by Llf and 4t> respectively. The similar result 
for the hazard measure L^ T is obtained by taking care of the fact that 
4({*io}) = co if Ly~({iio}) = 1. This happens if tio > tu, i?o({*io}) > 
and Hi({tio}) = 0. The convergence of F^ T is implied by the convergence of 
the associated hazard measure L^ T . □ 

The strong uniform convergence of F^ T can be obtained in a similar way. 
Define 

!,-(*)- L,-(*)- 



fl"([t,oo])' 21 ' ff(((,oo]) + /r,({(}) + /r 2 ({(}) 

and consider Fq,F 2 , the corresponding distributions. After some manipu- 
lations we can get rid of F^ in the definition (7): 

Next, apply Theorem 6.1 (see [10] for the details). 
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7. Asymptotic normality. Let (D[a, b],\\ • ||) be the space of cadlag 
functions denned on [a, b] C [0,oo], endowed with the supremum norm. 
BVc[a, b] C D[a, b] is the set of cadlag functions with total variation bounded 
by C. The integrals with respect to functions which are not of bounded varia- 
tion have to be understood via partial integration. Finally, weak convergence 
is denoted by ~» and is in the sense considered by Pollard [11], that is, D[a, b] 
is endowed with the ball cr-field. 

Given the explicit form of FLj, and F^ T , a convenient approach for proving 
weak convergence is the delta method (e.g., [5] and [15], Section 3.9). For 
proving Hadamard differentiability, the denominators appearing in the maps 
used to define Fj> and Fjf should stay away from zero. Therefore, we have 
to complete the delta method with a tool for treating the endpoints of the 
intervals on which weak convergence is finally proved. 

Lemma 7.1 ([11], page 70). Let X,Xi,X2, ■ ■ ■ be random elements of 
(D[a, b], || • ||) with the distribution of X concentrated on a separable set. Sup- 
pose, for each e,5 > there exist approximating random elements AX,AX\, 
AX 2 , . . . such that AX n ~> AX, P(\\X - AX\\ > 5) < e and 

(10) UmsupP(\\X n - AX n \\> 5) <e. 

n^oo 

Then X n ~^ X. 

For brevity, we consider only the asymptotic normality of F^ T ; similar 
arguments apply for F^ T . The empirical central limit theorem yields 

V^(H n - H, H 0n -H ,H 2n -H 2 )^ (G, G ,G 2 ) in D 3 ([0, oo]). 

Now, we prove that y/n{M~ 2 — M 2 ~) and \/n(F^ 2 — F 2 ) converge weakly 
to Gaussian limits. The computation of the covariance structures for the 
limit processes in this section is elementary, albeit tedious (see [10] for some 
formulae) . 

Lemma 7.2. Let M 2t = M 2 ~ ((t,oo]) and M~ 2t be the corresponding esti- 
mator. Assume that 



• M 2 (du) _ f H 2 (du) 
W°o]#([0,«]) i(too,oo] #(M) 2 < 
where too = inf{i : -f^o([0, t]) >0}. Then 

(12) ^(H n -H,H n0 -H ,M~ 2 -M 2 )^(G,G ,G M ) in D 3 [i 00 ,oo] 
where (G,Go,Gm) is a zero-mean Gaussian process with 

(13) G Mt = f Jn^W ~ / EFTiTr 1X2 ^2 W- 

J(t,oo] H([0,u\) J(t,ao] H([0,u\y 
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Moreover, if F 2 \ = F 2 7 ([0,t]), then 

Vn~(H n - H, H n0 - H , F n 7 2 - F 2 7 ) ^ (G, G , G 3 ) wi P> 3 [t 00 , oo], 
where (G, Go,^) is a zero-mean Gaussian process with 

(14) G 3t = Fi([0,t}) f - ^5" .. . 

7(t,oo] 1 - M 2 ({ti}) 

PROOF. The map (^4,B)— ► /, ,{1/A) dB is Hadamard-differentiable 
on a domain of the type {(A, B):Ae D[a,b],B £ BV c [a,b],A> e}, C,e > 0, 
at every point such that \/A is of bounded variation. The derivative map 
is given by (a,f3) -> /, ^ (1/A) d/3 - f,^(a/A 2 )dB. Therefore, the delta 

method for the map {H, Hq, H2) — > (H, Hq, M 2 ) yields the weak convergence 
of ^fn{H n - H, H n0 - H , M~ 2 - M 2 ) in D 3 [a, 00} , provided that H([0,a})> 
0. 

For the weak convergence in D s [too j 00 ] ■> consider the pathwise limit of 
Gjv/o- as a [ too, which exists in view of (11). It remains to verify (10) when 
H([0, too]) = 0. It suffices to prove the following: (a) for any e, 5 > 0, there 
exists a = a(e, 5) > too such that 



(15) limsupP sup ^\M~ 2 ([t,a)) - M 2 {[t,a))\> 5) <e; 

n-»oo \U<t<u J 

and (b) - v /nM 2 ~((too, U)) — > 0, in probability, where U = mhijYj. To ensure 
(a), reverse the time and apply the arguments usually used to check the 
"tightness at th = sup{t : H([0, t]) < 1}" when proving weak convergence for 
Nelson-Aalen and Kaplan-Meier estimators (see [3], Theorem 6.2.1, [4]). 
For (b), first note that (11) ensures M 2 ~((too, 00]) is finite. This implies 
F 2 ([0,too]) > (use, e.g., arguments as in Lemma 6 of [6]). Since in general 
M~ is smaller than M, deduce 

M-ll, rnvi,i. mi , H(%V)) , Fjlfoo.t/)) 

(Coo. t0) £ V)) = m < Tjp^jy 

Let = sup{s : \fnF 2 ((too> s)) < A} (see also [16]). We have 
P(v^F 2 7 ((ioo, CO) > A ) < p ( u > u n) = H{{u x n , OO])" 

<{i-p 2 7 ((t o,^])p / 1 ([o,^])r 

/ a 2 i^qo.^D y :Q 

- V n P/((t 00 ,^])y 
The convergence to zero is true because, in view of (11), 

Fi((too,u x ]) < r Fjjds) _ r M 2 -(ds) )Q n ^ Qo _ 
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Now, (b) is clear. For the last part of the lemma, apply the delta method 
for the map A — > 7t(. j00 ](l — A(ds)) defined on BVc[too,<x>], for some C > 0. 
□ 



Remark. In view of the variance of the process G3, it seems possible 
to relax condition (11) when -F/QO^oo]) =0 (see also [4]). However, in the 
following, due to the lack of an obvious martingale structure for L ^ — Llf ' , 
it is convenient to keep the denominator appearing in the definition of Llf 
away from zero when t [ too- For this, we have to impose F\ ([0, too]) > and, 
in this case, (11) is needed to bound the variance of G% t when t J, too- 

Now we state the asymptotic normality for L 1 ^ and F* T . The notation 
A_ means that we consider the left-limits of the process A. 

Theorem 7.3. Suppose condition (11) holds. Let too < r such that i?oi([0, 
t))<1. IfL I T ~ = L I J r([0,t]), then y/n(L%j.- L?f) ~*V in D[0,r], where 

t r f dG 0u f G 3u _ - G u ^ 

is a zero-mean Gaussian process. Moreover, if F^ t = F^([0,t]), then we have 
V^(F^ T -F^)^W in D[0, r] , with W the zero-mean Gaussian process 



W t = F^(t, 00}) f —j 
J(o,t] I — Ll 



dV u 



(o,t] 1 - I/ T ({u}) 

Proof. Since F^ x ( [r, 00] ) > and, by (11), F 2 7 ([0,t o]) > 0, we have 
inf( too T ] (F$ — H)([0, s)) > e, for some e > 0. Thus, if -ffo({*oo}) = 0, the weak 
convergence of \fn{JjJ I — L^) is obtained by the delta method for the map 
(A,B)^f [tooA (l/A-)dB (see [15], pages 382-384). 

When -ffo({*oo}) > (hence, necessarily too > 0), in the definition of Lif , 
we also have to take into account F\ ([0, too)) - F° r this, extend the weak con- 
vergence in (12) on -D 3 [0, 00] by considering a modified predictable reverse 
hazard function 

M n = M;>» H ^ 



J(t,oc] H([0,uVt o\) 



Let M~ 2 be the empirical counterpart. Since the denominator in the last 
display stays away from zero, the weak convergence of y/n(H n — H, H n Q — 
Hq,F^ 2 — F%) in L> 3 [0,oo] is easily obtained by the delta method, where 
now F 2 T , F^ 2 correspond to the modified M 2 ~ , M~ 2 , respectively. Note that 
now F2 ([0, too)) > 0. The processes Gm and G3 are still defined according to 
(13) and (14), respectively. Since the modification of and M~ 2 does not 
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change the definitions of Lfp and L nT , the delta method yields the weak 
convergence of y/n(L^ — L 1 ^). The last part of the theorem is obtained by 
the delta method for the product-integration map. □ 
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